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Introduction 


w-r/T he  discrete  Fourier  Transform  (OFT),  defined  below  m-42^4-,  is  a 
valuable  tool  in  many  fields  from  signal  orocessina  to  partial  differential 
equations.  There  is  strong  incentive  for  computino  the  transform  quickly, 
and,  after  two  decades  of  active  research,  we  now  know  the  minimum  number 
of  essential  multiplications  required  for  the  task  and  have  algorithms 
which  use  precisely  this  number.  It  does  not  follow  that,  in  the  end, 
these  will  be  the  most  desirable  techniques  but  they  are  certainly  of 
interest  in  their  own  right.  credit-  for  thfs-~fi~ne  work-seemsto--^— 

belong- to— L_d._  Good,  R.M.  Rader,- and  S.  Wineoradf  This  story,  and  more, 
is  told  in  the  book, -{5  J? 

At  the  heart  of  these  methods  lie  Winograd's  algorithms  for  n  point 
DFT's,  where  n  is  a  small  prime.  He  used  a  quick  way  of  forming  the 
product  of  two  polynomials  modulo  a  third  one  and  that  theory,  in  turn,  uses 
the  Chinese  Remainder  Theorem  and  some  abstract  algebra.  Section  5 
exhibits  the  correspondence  between  his  approach  and  ours. 

This  communication  points  out  that  these  optimal  algorithms  follow 
directly  from  the  spectral  factorization  of  certain  matrices  called 
circulants.  Specifically  the  real  and  imaginary  parts  of  the  eigenvectors 
of  these  circulants  take  the  place  of  polynomial  representations  in 
Winograd's  theory.  Such  an  approach  is  not  as  strange  as  it  may  seem 
at  first  glance;  if  an  arbitrary  column  vector  must  be  multiplied  by  a 
matrix  C  using  few  mul tipi ications  then  it  is  natural  to  see  whether  C 
can  be  written  as  C  *  X  D  Y  where  the  elements  of  X  and  Y  are  small 
integers  and  D  is  diagonal.  All  normal  matrices  can  be  factored  into  the 
canonical  form  XDX^  and,  for  some  small  circulants,  the  eigenvectors  X  of 
C  are  very  simple.  Another  connection  Is  that  I.J.  Good,  in  [3b],  showed 
how  to  use  the  DFT  to  invert  large  circulants.  We  are,  in  some  sense, 
inverting  that  work. 


•  ••  m  il 
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The  canonical  factorization  of  circulants  is  not  as  powerful  as  the 
number  theoretical  approach  of  Winograd.  For  example,  the  factorization 
does  not  yield  the  fast  algorithm  for  DFT(ll)  nor  the  proofs  of  minimality. 
On  the  other  hand  it  does  yield  the  algorithms  which  are  in  use  and  it 
does  so  in  a  rather  simple  way.  Some  people  may  enjoy  a  low  brow  deriva¬ 
tion  of  these  valuable  schemes. 

A  recent  survey  article  [2]  linked  the  DFT  with  an  impressive 
variety  of  topics  in  pure  and  applied  mathematics  and  yet  no  mention  was 
made  of  the  circulants  lurking  in  the  background. 

We  barely  mention  the  well  known  FFT  and  the  reader  is  referred  to 
[5]  to  see  its  connection  to  the  algorithms  discussed  here. 

2.  The  DFT  and  Cyclic  Convolution 

The  material  in  this  section  is  standard  but  must  be  included  if  only 
to  establish  the  notation. 

The  vector  or  sequence  {Xg,...,x  to  be  transformed  may  be 

thought  of  as  data  (complex  values)  given  at  n  equally  spaced  points.  The 
transformed  sequence  {xg,...,x  .j}  is  defined  by 


*  kj  in  1 

x .  ~  / .  to  x  • ,  ^  *  0» •  • .  »n-l  , 

K  j=0  J 


(2-D 


where 


o  *  =  exp(2-r  /"^1/n), 

Is  the  primitive  nth  root  of  unity.  Different  professional  groups  give 
somewhat  different  definitions  of  DFT  but  the  variations  are  minor.  For 


■ 
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example,  signal  engineers  call  (2-1)  the  inverse  OFT.  In  order  to 
suppress  indices  we  can  use  matrix  notation.  Let  x  and  x  denote  the 
column  vectors  associated  with  the  two  sequences  and  let  F  denote  the 
Fourier  Transform  matrix, 


F(u)  = 


1 

1 

1 

...  1 

1 

2 

n-1 

10 

CO 

...  CO 

1 

2 

4 

n-2 

<o 

CO 

...  w 

1 

n- 

• 

1 

•  *  *  • 

O) 

...  0) 

■  M 


(2-2) 


Note  that  the  order  of  F  is  implied  by  to.  Note  also  that  we  have 

used  the  relation  ion  =  1  to  reduce  all  the  exponents  in  F  below  n.  The 

definition  (2-1),  commonly  called  DFT(n),  now  becomes 


x  =  F(u)  )x. 
~  n~ 


(2-3) 


is  a  very  special  Vandermonde  matrix  and  a  well  known  property  is 


F4(o)n)  =  n2  I  . 


(2-4) 


The  letter  I  denotes  the  identity  matrix. 
The  Inverse  OFT  is  simple 


S  ■  i  F(Un-’)  x  , 


(2-5) 


The  eigenvectors  of  F  are  too  complicated,  see  [2],  to  yield  a  fast 
method  for  computing  the  OFT  so  we  turn  to  other  arrangements. 
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It  is  customary  to  remove  the  trivial  part  of  the  computation  of 
(2-3)  as  follows.  Since 

x0  s  x0  +  xi  +  •••  +  Vi 

the  essential  computation  is 


-  x, 


n-1 

■  £ 
j-i 


u) 


kj 


k  =  1  , . . . ,n-l  , 


(2-6) 


or,  equivalently, 

n-l  .. 

-  xn  =  £  (u)kJ  -  1)  x.  ,  k  =  1 . n-l  (2-7). 

K  u  j«l  3 

The  remarkable  fact  is  that  the  x^,  k*l,...,n-l,  can  be  computed  using 

2 

between  n  and  2n  multiplications  instead  of  (n-l)  as  suggested  by 
(2-6)  or  (2-7). 

In  1968  C.M.  Rader  pointed  out  in  [6]  that  when  n  is  prime  the 
variables  can  be  reordered  so  that  (2-6)  becomes  a  cyclic  convolution. 

Let  us  illustrate  this  for  n  =  5,  in  which  case  it  is  only  necessary  to 
exchange  the  last  two  variables. 


X1  "  xo 
x2  "  x0 
x4  ‘  x0 
*3  ‘  x0 
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00 

00 

0) 
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(2-8) 


Note  that  each  row  of  the  matrix  is  obtained  by  shifting  left  the  row 
above  in  a  cyclic  fashion.  The  new  ordering  is  monotonic  when  {1 ,2,3,4} 
is  seen  as  a  multiplicative  group  modulo  5: 


2°  21  22  23 

t  t  t  t 


(mod  5) 


Rader's  observation  reduces  the  OFT  computation  to  that  of  cyclic  convolu¬ 
tion.  There  are  several  clever  tricks  available  for  doing  this  quickly 
but  it  was  Winograd's  achievement  to  determine  the  minimal  number  of 
multiplications  that  are  needed  and  to  exhibit  the  algorithms  which  achieve 
the  minimum.  The  minimum  depends  quite  strongly  on  n.  It  also  depends  on 
the  algebraic  field  in  which  the  multiplication  is  taken  to  act  but  we  will 
not  focus  on  this  aspect  of  the  algorithms  which  is  treated  fully  in  [5]. 

Finally  we  remark  that  the  case  when  n  is  prime  is  the  important  one 
as  explained  in  Section  4. 

3.  Circulant  Multiplication. 

We  take  n  to  be  prime  and  rewrite  Rader's  observation  slightly. 
Instead  of  taking  both  the  {x ^ }  and  the  {x^}  in  the  same  new  ordering  we 
reverse  it  for  the  {x j } .  When  n  *  5  this  yields 
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Of  course  the  only  difference  from  (2-8)  is  that  each  row  of  the  matrix 
in  (3-1)  is  obtained  from  the  row  above  by  shifting  right  instead  of  left. 
The  advantage  is  that  the  matrix  in  (3-1)  is  a  circulant  matrix  and  a  great 
deal  is  known  about  them.  See  [1]  and  [4]. 

For  all  prime  n  Rader's  observation  reduces  DFT(n)  to  multiplying 
an  arbitrary  vector  by  a  special  circulant  matrix  of  order  n-1 .  We  now 
list  the  standard  facts  about  circulant  matrices. 


i 

'  FACT  1.  Every  m  x  m  circulant  is  a  polynomial  in  the  full  cycle 
(or  shift)  matrix 


0  0  .  .  1 

1  0  .  .  0 


1 


0 


.010 


(3-2) 


The  coefficients  of  the  polynomial  are  given  by  column  1  of  the  circulant. 

The  degree  is  less  than  m. 

There  is  nothing  to  prove  here  but  we  will  illustrate  the  property  using 
the  matrix  in  (3-1)  which  can  be  written  as 
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<f)(P^)  =  cjl  +  0)2P4  +  + 


3  p  3 

<*)  r.  . 


(3-3) 


0  4 

Note  that  I  =  P4  =  P4  and,  in  general,  note  the  special  pattern  of 

the  1 1 s  in  the  powers  of  P  . 

m 


| 

FACT  2.  Pm  is  a  permutation  matrix.  Its  eigenvalues  are  the  roots  of 

I 

unity  and  its  eigenvectors  are  mutually  orthogonal.  Moreover 

I 

Pm  is  real  and  so  it  has  a  real  canonical  form  A  =  A  which 
m  -  m 

i 

|  is  a  direct  sum  of  powers  of 


1 


(cos  6  -sin  0' 

sin  8  cos  0/ 


0  =  27r/m, 


together  with  1  and,  if  m  is  even,  -1  . 


I 


Corresponding  to  each  complex  eigenvalue  are  the  real  and  imaginary 

parts  of  the  eigenvector.  These  two  real  vectors  span  the  associated 

invariant  plane  and,  of  more  importance  to  us,  these  two  vectors  are  columns 

of  an  orthogonal  (real)  matrix  which  reduces  P  to  A  .  In  symbols, 

mm-' 

P  5  S  A  S  .T  .-1  ,  _  . . 

mmmm,S=S.  (3-4) 

Here  ST  denotes  the  transpose  of  S.  We  call  (3-4)  the  real  spectral 
factorization  of  P_  and  show  it  for  small  values  of  m  in  Table  1. 


[Table  1  could  go  near  here.] 
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Eigenvectors  are  only  defined  up  to  a  constant  nonzero  factor. 

Hence  there  is  no  loss  in  generality  in  writing 

S  =  GA  (3-5) 

where  A  is  diagonal  and  positive  definite  and  may  be  chosen  at  our 
convenience. 


FACT  3.  Let  <j>(P  )  be  any  m  x  m  circulant  and  let  =  (GA)  a  (GA)T 

be  the  real  spectral  factorization  of  the  cyclic  shift  matrix 

P  .  Then 
m 

<D(PJ  =  GA4>(  a  )  ( GA)T  (3-6) 

is  the  associated  spectral  factorization  of  $(P  ).  In  other 
words,  the  eigenvectors  of  Pffl  are  eigenvectors  of  <j>( Pm )  for 
any  polynomial  <p, 


Whenever  G's  elements  are  small  integers  then  (3-6)  provides  a 
minimal  multiplication  algorithm  for  forming  the  product  i>(Pm)x.  In  symbols, 

<|>(P  )x  =  GA4>(Pm)AGTx 

=  G(D(GTx)) ,  (3-7) 

where 

0  =  A<S(Pm)A 

is  block  diagonal  with  2x2  and  lxl  blocks.  Only  the  application  of 
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D  involves  genuine  multiplications,  G  and  G  act  via  additions  and 


subtractions. 


[Table  2  could  go  near  here.] 


The  only  way  that  the  OFT  affects  the  circulant  product  (3-7)  is 


through  the  complex  polynomial 


In  Table  2  we  list  <p  and  $(R^  ) 


for  several  small  values  of  m.  Recall  from  Fact  2  that  R  is  the  matrix 

m 

representing  rotation  through  an  angle  2-i/m.  It  turns  out  that  <j>(R  ) 
is  also  of  the  form  ~ g  )  and,  as  a  bonus,  B  and  y  are  either  both 
real  or  both  pure  imaginary.  Multiplication  of  a  real  vector  by  such  a 
matrix  requires  3  real  multiplications  and  3  additions.  This  holds  even 
when  B  and  y  are  matrices.  One  implementation  follows  from  the 
matrix  identity 


I  0  I 
I  I  0 


COO 


0  B+C  0 


0  0  B-C 


I  -I 


0  I 


I  0 


(3-8) 


In  general  (3-8)  is  preferable  to 


I  I 


■II  i  I 


sCB+iC)  0 
0  Js(B-iC) 


I  i  I 


I  -il 


(3-9) 


Another  useful  identity  is 


B  C\  /I  I\  /  »s(  B+C )  0 


I  -I 


0  »s(B-C) 


I  I 


I  -I 


(3-10) 
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which  is  just  a  block  version  of  (3-7)  when  m  =  2.  We  shall  use  (3-8)  and 
(3-10)  later. 

For  completeness  we  give  the  nonzero  elements  of  D  in  Table  3. 

These  numbers  are  the  values,  at  the  (n-1  )st  roots  of  unity,  of  a  polynomial 
of  which  the  coefficients  are  nth  roots  of  unity. 

[Table  3  could  go  near  here.] 


Our  interest  is  in  the  number  of  real  multiplications  required  to 
compute  DFT(n).  For  reasons  that  appear  in  the  next  section  we  make  the 
peculiar  definition 

mult(n)  =  1  +  no.  of  multiplications  required  for  DFT(n). 

If  n  is  prime  and  G's  elements  are  small  integers  then 

mult(n)  =  3 (n-1  )/2  =  2n  -  1  -  (n-1  )/2 

when  x  is  real.  For  complex  data  the  counts  are  only  doubled.  On  binary 
computers  the  count  can  be  reduced  by  1  when  n  =  3,5  because  multiplica¬ 
tion  by  h  or  h  can  be  accomplished  by  a  shift. 

It  is  clear  that  DFT(2)  and  DFT(4)  require  no  multiplications. 

However  in  dealing  with  larger  values  of  n  we  must  define 

mult(2)  =  2,  mult(4)  =  4. 

More  precisely 


mult(n)  =  max{n,  1  +  no.  of  multiplications  required  for  DFT(n)}. 
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In  order  to  obtain  a  systematic  development  of  the  G  matrices 
for  larger  values  of  m  it  is  worth  noting  that  the  factorization 
developed  in  this  section  extends  immediately  to  matrices  of  the  form 


Such  matrices  are  polynomials  in  the  orthogonal  matrix 


0  0  0  -1 


10  0  0 
0  10  0 
0  0  10 


The  only  difference  is  that  the  eigenvalues  of  P  are  in  the  roots  of  -1 

m 

(instead  of  +1),  The  factorization  (3-10)  reduces  a  circulant  of  order  2m 
to  a  direct  sum  of  a  circulant  of  order  m  and  one  of  these  "improper" 
circulants  of  order  m. 

In  order  for  G  to  have  small  integer  elements  it  is  necessary  that 
xm-l  have  no  irreducible  factors  (over  the  rational s)  of  degree  exceeding  2. 


4.  The  OFT  Factorization 

In  order  to  deal  nicely  with  DFT(n)  for  large  n  it  is  convenient  to 
recast  the  results  of  the  previous  section  as  a  factorization  of  the  DFT 
matrix  itself.  The  task  is  to  compute  F(un)x  =  x. 
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The  first  step  of  eliminating  Xg  and  Xg  can  be  seen  as  performing 
one  step  of  triangular  factorization  of  FUn),  This  leads  to  (2-7) 
rather  than  (2-6).  For  example. 


1  io  0  0  0 

1 !  0  0  0  0 

1 ! 

1 

0; 

1  '  r 

1 

O'  /  yv  .  . 

J  (cj  -1 ) 

0« 

1 

l! 

1 

Oj  u^=l, ...  ,4 . 

- 

.  m 

r.i 

1111 

1  0  0 

1 

1 

0 

4 

0 

Section  3  showed  that,  when  n  is  prime,  there  are  permutation  matrices 
rt-|  and  II2  such  that 


"lF(“n>"2 


1)0000 

1*0000 

1  h  i  i  1 

1! 

1 

0! 

1 

0! 

1 

l!  r 

G 

°i  0 

°|  GT 

l! 

1 

0« 

1 

0* 

1 

l! 

1 

0! 

t 

°i 

m  - 

* 

.  ^ 

(4-1) 


When  G's  elements  are  small  integers  this  leads  to  an  algorithm  with 
minimal  number  of  mul tipi ications .  The  D  matrices  exhibited  in 
Section  3  were  based  on  (2-6)  but  the  only  modification  needed  to  conform 
to  (2-7)  is  to  change  the  top  element  of  D  from  -l/(n-l)  to  -n/(n-l). 
Verification  of  this  assertion  is  left  to  the  reader. 

With  a  slight  abuse  of  notation  we  will  write  (4-1)  as 

*  gdgt. 


(4-2) 
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or 


F(u»n)  ^GDGT,  (4-3) 

and  we  call  this  the  OFT  factorization  of  F(<i)  ).  The  context  will  make 
it  clear  whether  F((jJn)  or  a  circulant  matrix  of  order  n-1  is  being 
factored. 

Note  that  ^  (equivalence  under  permutations)  is  a  true  equivalence 
relation. 

The  important  observation  of  I.J.  Good  in  [3a]  may  be  summarized  as 


Theorem.  If  l  and  m  are  relatively  prime  then 

F<V  '''F<'VeF<“„> 


(4-4) 


Here  ®  denotes  the  direct  (or  Kronecker  or  tensor)  product  of  matrices, 
namely 


A®  B 


aUB  al  2B 
a21 8  a22B 


•  • 


-14- 


Example:  F(wg)  ^  F(w^)  8>  F(w3) 


Algorithm:  x,  =  F(uO  x^  , 


X_  -  F  ( cu^ )  x_  i 


In  order  to  emphasize  the  theorem's  hypothesis  we  point  out  that 
F(u*g)  t  F(oj3)  S  F(u)3). 

Thus  a  2  dimensional  OFT  on  3x3  points  is  not  equivalent  to  a  1  dimensional 
OFT  on  9  points. 

There  is  a  routine  procedure  for  changing  the  DFT  factorization  (4-3) 
into  an  enlarged  form  GjOG-j  where  6  is  diagonal  (not  just  block  diagonal) 
and  has  an  order  not  less  than  0,  namely  mult(n).  It  uses  the  identity 
(3-8)  to  replace  (3  ~\)  by  d1ag(Y,6+Y,S-y) .  Note  the  G,  is  not  the 
transpose  of  G-j .  We  illustrate  the  procedure  when  n  *  5. 
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Figure  1.  Transition  to  strictly  diagonal  form. 
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The  implication  of  Good's  theorem  is  clear.  Given  fast  algorithms  for 
DFT(i)  and  DFT(m),  i.e. 


rW  '  Wt 


Then,  provided  i  and  m  have  no  common  factors. 


F(%)-F(u,)®  F(%). 


(4-5) 


yields  a  fast  algorithm  for  DFT(2m).  Formula  (4-5)  invokes  a  valuable 
property  of  direct  products,  see  [4]. 

The  number  of  multiplications  required  is  best  seen  from  the  full 
representation 


F<'V 


(s 


(2) 


& 


0  ) 


®  G 


(1). 


(4-6) 


Since  6.  ®  5  is  diagonal  of  order  mul t(£) -mul t(m)  we  have 
Z  m 


mult(Zm)  =  mul  t(2,) -mul  t(m) . 


(4-7) 


For  example,  mult(210)  =  mult(2-3-5-7)  =  324. 

We  say  that  (4-5),  or  (4-6),  is  the  DFT  factorization  of  F(u.). 

In  order  to  get  a  fast  DFT  algorithm  for  an^  natural  number  n  it 
is  necessary  to  exhibit  a  DFT  factorization  for  prime  powers  pr.  Unfor¬ 
tunately  this  becomes  very  messy  as  r  increases.  Circulants  do  appear 
but  they  have  to  be  combined  with  subblocks  which  involve  DFT(p  )  for 
k  <  r.  We  content  ourselves  with  exhibiting  the  DFT  factorization  for  n  =  9 
and  n  =  8  in  Table  4. 

[Table  4  could  go  near  here.] 

One  consequence  of  this  messiness  and  our  ignorance  of  mult(p  ) 
is  that  the  favorite  composite  numbers  for  the  DFT  are  those  with  a  variety 
of  prime  factors.  This  is  in  stark  contrast  to  the  FFT  of  Cooley  and 
Tukey  which  favors  n  ■  2  . 

This  concludes  our  elementary  presentation  of  Winograd’s  DFT.  There 
is  far  more  to  this  subject  than  we  have  indicated  here. 
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5.  Relation  of  Circulant  Multiplication  to  Winoqrad's  Formulation 

It  is  instructive  to  see  in  detail  the  correspondence  between  our 
factorization  of  the  circulant  and  Winograd's  derivation  of  cyclic 
convolution  using  polynomials. 

Let  u  be  the  indeterminate  in  the  polynomials. 

Cyclic  convolution  on  n  points  can  be  rephrased  as  the  formation 
of  the  product  of  two  arbitrary  polynomials  modulo  the  polynomial  un~^  -  1. 
This  can  be  done  rapidly  with  the  aid  of  the  Chinese  Remainder  Theorem. 

For  simplicity,  we  take  n  =  5  and  define 

2  3 

X(u)  =  X.|  +  X^u  +  X4U  +  X^U  , 

.  2  42  33 

ft(u)  =  u)  +  (jJU+u)U  +  WU, 

~  ~  ~  ~  2  ~  3 

X(u)  =  x1  +  x^u  +  x^u  +  x^u  . 

The  object  is  to  find  X  given  any  X  and  a  fixed  ft.  It  turns  out  that 
X  =  ftX  mod  (u4-l ) . 

The  fast  algorithm  requires  knowledge  of  the  irreducible  factors 
of  u"'1  -  1 .  In  our  case 

u4  -  1  =  (u-1 )  (u+1 )  (u2+l ) , 
i  (u)  Q2(u)  Q3(u), 

defining  ,  Q2,  and  Q^.  Also  needed  are  another  set  of  polynomials,  of 
degree  less  than  4,  which  are  defined,  for  k  *  1,2,3,  by 


Vu)  * 


mod  Qk, 
mod  Qj,  j  f  k. 
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The  Sk  are  analogous  to  the  fundamental  polynomials  for  Lagrangian 
interpolation  in  that  they  serve  to  reconstruct  a  polynomial  given  only 
its  values  "at"  the  Qk>  The  word  "at"  should  be  construed  as  "modulo" 
in  the  previous  sentence.  The  existence  of  the  Sk  is  the  essence  of 
the  Chinese  Remainder  Theorem. 

Finally  define  dk(u)  =  il(u)  mod  Qk>  k  *  1,2,3. 

Now  we  can  formulate  the  algorithm  succinctly. 

Phase  1  (called  the  preweave  by  signal  engineers): 

Form  =  X  mod  Q^,  k  =  1,2,3. 

Phase  2  (multiply): 

Form  Xk  =  mod  Qk,  k  =  1,2,3. 

Phase  3  (called  the  postweave  or  recovery): 

Form  X  =  A1S1  +  +  X3$3. 

When  the  Sk  have  Q,±l  coefficients  then  Phase  3  needs  no  multiplica¬ 
tions.  By  the  Remainder  theorem  X^  =  X(l),  X^  3  X ( -1  ) ,  X3  =  X^  + 
X^  u,  X  3^  1  xi  '  x4>  etc- 

Let  us  determine  the  Sk  and  ,Tk  explicitly.  It  is  easy  to 

veri  fy  that 

S-|  *  ‘VQjQj  *  +  u  +  u2  +  u3), 

52  *  "^1^3  *  ^0  -  u  +  u2  -  u3), 

53  *  -  SO  -  u2  ). 

Since  u>  *  exp(2-ri/5)  a  little  algebra  yields 


When  the  steps  are  laid  out  in  matrix  form  the  connection  with  circulant 
factorization  is  apparent.  (See  Table  1.) 
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Phase  1 


1111 
1-1  1-1 
10-10 
0  10-1 


Phase  2. 


Q1  0  0  0 
0  0  0 
0  0 

o  o  ji3^) 


Phase  3. 


1110  k  0  0  0 

1-10  1  0  W  0  0 


1  1  -1  0  0  0  ^0 
1  -1  0  -1  0  0  0  h 


f.  f  A  JL 

S1  S2  S3  S3U 


To  recover  the  polynomial  X  in  Phase  3  it  is  only  necessary  to  pre- 

2  3 

multiply  by  the  row  vector  (1  u  u  u  ).  Of  course  the  constants 


\  are  absorbed  into  the  matrix  in  Phase  2, 


-2 


Table  1 


"Pm  =  (GA)  a  (GA)T" 

Real  spectral  factorization  of  the  cyclic  shift  matrix" 
Notation:  A  @  B  =  diag(A.B)  =  direct  sum  of  A  and  B. 


,  b  =21,  A  = 


1  0] 

I 

0  -1  j 


3:  G 


2  0 


1-11, 

1  -1  -1 

y 


_2  cos  ^  -Sln 

=  diag(3,6,2),  A  =  1  $  3  3 

sin  ^  cos  ^ 


4:  G  = 


1110 
1-10  1 
11-10 
1-1  0-1 


=  d i a  g ( 4 ,4,2 ,2 ) , 


0  -1  ' 
1  0; 


6:  G  = 


1  1  2  0  2  0 
1-1  1  1-1  1 

1  1-1  1-1-1 

1-1-2020 
1  1  -1  -1  -1  1 

1-1  1-1  -1  -1 


diag(6,6 ,1 2 ,4 ,1 2 ,4) , 

f  1  0  1  2 

®  R  ®  R  , 

0  -1  J 

cos  t/3  -sin  tt/3 
sin  tt/3  cos  tt/3 
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Table  2 


"The  circulant  polynomial  arising  from  DFT(m+l)." 

_  f  0  -1  ) 

a.  =  exp(2’T  /-I  /j  ) .  J  =  j  . 

J  1  0  j 


4>(t)  =  ui,  +  a).*2 1 

J>(A)  =  0(1  )  +  *p(-l )  =  -1  ®  /IJ 


m  =  4:  $(t)  =  w5  +  cu52t  +  u>54t2  +  uv3t3 

0(1)  =  -1,  0(-l)  =  2(cos  2tt/5  -  cos  4-t/5)  =  2(cos  ^  +  cos  y) 

sin  2t/5  -sin  4tt/5' 

0(R)  =  2/-1 

sin  4-it/5  sin  2tt/5 

m  =  6:  0(t)  =  ay  +  oy2t  +  cy2t2  +  uy^t2  +  uy4t4  +  iy2t2 

0(1)  =  -1 ,  0(-l )  =  2/-1  (sin  ~  +  sin  ^y  -  sin  -^y) , 

0(R)  =  ( 2co  +  oj2  -  uj2  -  2co^  -  to4  +  w^)I  +  /3  (co2  +  uj2  -  co4  - 

=  /-H(2  sin  -  sin  ^y  +  sin  ^y)I  +  v/3(sin  ^y  +  sin  ^y)J;, 

=  /2T{(sin  y  +  2  sin  ^y  -  sin  -^y)I  +  vT(sin  y  +  sin  ^y)J}. 

0(R  )  =  oyl  +  cy  R  -  cy  R  +  cy^I  +  cy4R2  -  uy2R 

=  Jj(2(iJ  -  (A)  -  UJ  +  2a>^  -  Cl)4  -  U)5)I  +  -y  ( s'  -  u)2  +  a)4  -  U)5)J, 
=  (2  COS  — y  -  COS  — y  -  C0S^y)I  +  /T(  -  COS  —  +  COS  0 , 

=  (COS  y  +  2  COS  — y  +  COS  — y) I  +  /T(  -  COS  y  +  COS  ^y)  J . 
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Table  3.  "D  =  A<>(a)a" 
A<£B  =  diag(A,B),  i  =  /-l . 


n  =  3:  D  = 


n  =  5 :  D 


1  ”  ? 


n  =  7:  D  = 


^  &  j(-sin  j  +  sin  +  sin  ^y)i  * 


-  8  -Y  ' 
‘  Y  3  . 


n  1  ,  .  TT  -  .  2lT  .  3tTv 

0  =  yy  (Sln  7  +  ^  Sin  —  -  sin—), 

Y  =  ^  (sin  j  +  sin  -y) , 

6  =  yjr  (COS  y  +  2  COS  ~y  +  COS  ^y)  , 

1  t  IT  3lT\ 

n  =  y  (-COS  y  +  COS  -y)  . 

B+y  =  g-  (  2  sin  y+  sin  —  +sin  — ) 

n  1  ,  •  it  .  2tt  0  3itn 

3-y  =  g-  (  -sin  y  +  sin  —  -  2  sin  — ) 

6+n  -  j  (-  cos  y  +  cos  —  +2  cos  — ) 

6-n  =  g-  (2  COS  y  +  cos  —  -  cos  — ) 


Note  1.  Some  minor  modifications  will  be  given  in  Section  4. 


Note  2.  All  angles  have  been  reduced  to  the  range  (0,ir/2). 


1 
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Table  4a. 


The  DFT  Factorization  for  n 


o 

X 

"l  1 

1 

1 

1 

1 

1 

1 

1 

A 

xo 

1  1 

1 

3 

6 

3 

6 

3 

6 

x3 

00 

00 

00 

CO 

Ou 

to 

X3 

1  1 

1 

6 

3 

6 

3 

6 

3 

XG 

00 

oo 

00 

CO 

oo 

■oo 

x6 

1  3 

1  to 

6 

5 

7 

8 

4 
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X1 

00 

00 

to 

to 

00 

to 

to 

X1 

1  6 

1  td 

3 

2 

5 

7 

8 

4 

x0 

= 

00 

00 

(0 

to 

CO 

to 

/o 

x5 

1  3 

1  OJ 

6 

4 

2 

5 

7 

8 

x4 

00 

00 

00 

00 

CO 

oo 

to 

x7 

1  6 

1  (O 

3 

8 

4 

2 

5 

7 

x8 

OJ 

00 

to 

to 

CO 

00 

to 

CO 

X 

1  3 

1  0J 

6 

7 

8 

4 

2 

5 

X7 

oo 

oo 

00 

00 

CO 

00 

to 

x4 

1  6 

1  0J 

3 

5 

7 

8 

4 

2 

LO 

X 

00 

00 

to 

00 

oO 

oo 

to 

X2  J 

0  ; 


I 


F<“3) 


111 

00G 

000 


000000" 

101010 

010101 


,(0) 


.(1) 


010 

001 

010 

001 

010 

001 


:  J 

L  ! 

J 

l  J 

- 

1 

fin;  oooooo' 

1 

F(<o-) ; 

o  ;  o 

000!  101010 

i 

■ 

000' 01 01 01 

I 

0  ! 

I 

1 

[ 

! 

’6  ! 

1 

0  G6T 

1 

i 

i 

I 

_ L_ 

1 

_ 1 _ 

1 

_ l _ 

o  j 

i 

c 

!f(co3) 

0 

.(0) 


,0) 


*p(t) 


CO  + 


w5t 


(0?t 


V* 


U)4t 


Ugt' 


exp(2u  /-1/9), 


Ag  =  -1  (h  a/-l  *  ^  Y  ^  /-I  $  ^  ^  ,  see  Table  1 . 


mul  t(  9)  =3  +  3  +  3  =  14.  06  =  -^(  Ag)  - 
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Table  4b.  The  DFT  Factorization  for  n  =  2 . 


1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

-1 

-1 

-1 

-1 

1 

-1 

-1 

i 

i 

-i 

-i 

1 

-1 

-1 

-i 

-i 

i 

i 

-1 

i 

-i 

“(X) 

-iu 

i  oo 

-1 

i 

-i 

-a) 

a) 

iu 

i  oo 

-1 

-i 

i 

-la) 

ia> 

a> 

-O) 

-1 

-i 

i 

iw 

-  i  oo 

-U) 

0) 

11111111 
1  1  1  1  -1  -1  -1  -1 

1  1  -1  -1  0  0  0  0 

0  0  0  0  1  1  -1  -1 

1  -1  0  0  0  0  0  0 
0  0  1  -1  0  0  0  0 

0  0  0  0  1  -1  1  -1 

0  0  0  0  1  -1  1  -1 


where  a  =  1//2  =  cos  t/l  =  sin  V4,  i  =  /-I 


The  cyclic  convolution  on  {x^  .x^}  does  not  lead  to  an  optimal 

scheme  here. 


mul t(8)  =  max  {8,2}  =  8. 


I  X 
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